As we have discussed in class, many urban problems exacerbate the inequalities within an urban system. After over a year and a half of pandemic, Covid-19 still poses a threat to the population’s ability to health, travel, work, and gather, though some are more affected than others. It is no secret that Covid-19 is greatly affecting communities of color and impoverished communities, and these two groups have a lot of overlap. One such community of interest in examining the impact of Covid-19 on vulnerable populations is Oakland, because it contains zip codes with large differences in poverty levels as well as many racially segregated neighborhoods. Oakland is a high impact city to look at because of its large and diverse population. With over 433,000 residents, the city has a relatively equal split between White, Black, and Hispanic/Latino population with a strong Asian presence as well (source 4). In addition, the poverty rate is 16.7%, which is over 5% higher than the national average.
With Oakland’s demographic, Covid-19 has impacted populations in a disproportionate way. One example of this is seen by comparing the zip codes 94603 and 94618, which are on opposite sides of Oakland. Zip code 94603 has 30% of children living below the poverty level and the highest Covid rate in the city whereas zip code 94618 has 4% of children living below the poverty level and the lowest Covid rate in the city (source 1). Another key insight about this example is that zip code 94618 is 77% White while zip code 94603 is majority Black or African American (source 2). Furthermore, at the City of Oakland Racial Disparities Task Force Town Hall it was mentioned that in Alameda County, Latinx make up 22% of the general population and 46% of the COVID-19 caseload and African-Americans make up 10% of the general population and 14% of COVID-19 cases (source 3). These startling statistics clearly indicate further need to prioritize researching the impact of Covid-19 on the basis of both race and poverty. Local leaders in Oakland have announced how extremely worried they are about these racial disparities in people of color as well as low-income people, immigrants, people with disabilities, and others (source 5). There are many factors that could also be indicative of Covid-19 impact, risk, and response, so we were also curious to see how the numbers might also relate with covid testing rates, which could alert community members to infection and allow them to take the necessary precautions to avoid further spread. We plan on finding relationships between these variables as well as map them geographically to provide some insights on Covid-19 in Oakland as well as predict future Covid-19 rates. For this reason, we decided to use Census Data for income and race and ArcGIS Hub data that records Alameda County COVID-19 Cases and Case Rates by Zip Code (https://hub.arcgis.com/datasets/5d6bf4760af64db48b6d053e7569a47b_0/explore?location=37.679493%2C-121.905640%2C10.88, https://hub.arcgis.com/datasets/5d6bf4760af64db48b6d053e7569a47b/explore?layer=4&location=37.679103%2C-121.905640%2C10.88 ).
Our ultimate goal was to create a dashboard that makes it easy to view Oakland zipcodes in terms of the presence of a chosen income level, race, COVID testing rate, and COVID case rate. Some key insights we hoped to find were 1) how the amount of COVID testing correlates to covid cases (ie. does testing seem to actually have a strong relationship with covid case rate as many say), 2) does access to testing appear equal on the basis of racial or economic background (ie. what does accessibility seem like?) and 3) also evaluating the disparities in Oakland more generally. One issue to note was that due to having data only as granular as the zipcode level for Oakland, it was not statistically significant to only provide regression data for the variables based on Oakland zipcodes, so our overall analysis of the trends seen in Oakland through our graphs had to be supplemented with regression results for Alameda county, which does add a variable of inconsistency, but we still felt we were able to draw informative conclusions. Our reflections with specific instances of graphs we wanted to point out are presented below, but the link to our dashboard can be found at the top of this report.
COVID Dataset from Alameda County COVID-19 Case/Case Rates by Zip Code GeoJSON API URL: “https://opendata.arcgis.com/datasets/5d6bf4760af64db48b6d053e7569a47b_0.geojson”
COVID Dataset from Alameda County COVID-19 Test Rates by Zip Code GeoJSON API URL: “https://opendata.arcgis.com/datasets/5d6bf4760af64db48b6d053e7569a47b_4.geojson”
My first major assumption is that the breakdown of race in this visual is representative of the breakdown of population in Oakland. Another assumption is the validity, completeness, and accuracy of the data set used. This data set is gathered and produced by the US Census, so we are assuming it is from a credible source on the topic and was gathered in a fair and unbiased way.
We noticed that when trying to see the relationship of COVID cases versus COVID testing that outliers were present and make the results less relevant, so we wanted to find which zipcodes caused these outliers and removed them to allow the results to be more accurate
##
## Call:
## lm(formula = CaseRates ~ TestRates, data = alameda_grouping_by_zip1)
##
## Residuals:
## Min 1Q Median 3Q Max
## -6030.2 -2471.6 -298.1 1952.1 11263.8
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 5.784e+03 5.741e+02 10.074 1e-13 ***
## TestRates 1.601e-01 5.631e-02 2.842 0.00643 **
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 3796 on 51 degrees of freedom
## Multiple R-squared: 0.1367, Adjusted R-squared: 0.1198
## F-statistic: 8.079 on 1 and 51 DF, p-value: 0.006426
From this we can identify there are some zipcodes with data points that deviate far from the rest of the data points, specifically 94720, 95377, 94621,94613 and 94603, so we will remove these frames from the regression to help improve it
##
## Call:
## lm(formula = CaseRates ~ TestRates, data = alameda_grouping_by_zip1)
##
## Residuals:
## Min 1Q Median 3Q Max
## -5390.5 -2214.7 -19.1 1527.7 7656.7
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 8665.7717 1187.5665 7.297 3.27e-09 ***
## TestRates -0.9230 0.3856 -2.394 0.0208 *
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 2794 on 46 degrees of freedom
## Multiple R-squared: 0.1108, Adjusted R-squared: 0.09143
## F-statistic: 5.73 on 1 and 46 DF, p-value: 0.02082
As we can see, removing the outliers changed the trend of the graph, and so now we are ready for analysis.
##
## Call:
## lm(formula = estimate ~ CaseRates, data = alameda_grouping_by_zip2)
##
## Residuals:
## Min 1Q Median 3Q Max
## -425.59 -154.16 -43.57 104.49 834.73
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) -15.69792 94.30960 -0.166 0.869
## CaseRates 0.06805 0.01417 4.803 1.7e-05 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 284.7 on 46 degrees of freedom
## Multiple R-squared: 0.334, Adjusted R-squared: 0.3195
## F-statistic: 23.07 on 1 and 46 DF, p-value: 1.696e-05
Sources: https://calmatters.org/health/coronavirus/2021/06/california-covid-inequality-oakland-rockridge/ https://www.unitedstateszipcodes.org/ https://www.accfb.org/how-covid-19-is-affecting-communities-of-color/ https://www.census.gov/quickfacts/oaklandcitycalifornia https://www.oaklandca.gov/news/2020/local-leaders-announce-covid-19-racial-disparities-task-force